BUG: timedelta64(NaT) incorrectly treated as datetime in some dataframe ops #28049

jbrockmendel · 2019-08-21T01:00:31Z

…me ops

jbrockmendel · 2019-08-26T16:11:39Z

@TomAugspurger thoughts? lots of ops stuff in the works, getting really close to fixing the perf problem

pandas/core/ops/__init__.py

TomAugspurger

LGTM. Could a user hit this from normal code? (i.e. do we need a release note?)

jbrockmendel · 2019-09-03T16:19:24Z

Could a user hit this from normal code? (i.e. do we need a release note?)

Yes. e.g. the test this implements incorrectly raises TypeError on master. Will update with release note.

jbrockmendel · 2019-09-10T23:26:24Z

@jreback gentle ping

jreback

lgtm, just move the note to 1.0

jreback · 2019-09-17T12:37:15Z

doc/source/whatsnew/v0.25.2.rst

@@ -18,7 +18,7 @@ Categorical

 Datetimelike
 ^^^^^^^^^^^^
-
+- Bug in :class:`DataFrame` arithmetic operations when operating with a :class:`Series` with dtype `'timedelta64[ns]'` (:issue:`28049`)


can move to 1.0

updated+green

WillAyd

Minor clarification point. Was about to merge but want to double check before.

Note I fixed up unneeded changes to the 0.25.2 whatsnew so would need to pull locally if a change required here (or edit directly in GH)

WillAyd · 2019-09-18T16:30:46Z

pandas/core/ops/__init__.py

+            right = np.asarray(right)
+
+            def column_op(a, b):
+                return {i: func(a.iloc[:, i], b[i]) for i in range(len(a.columns))}


Just to double check - is the second argument to func supposed to be accessed via .iloc here as well?

Side note - an alternate approach for by column iteration is to call df.columns; not sure if there is a perf difference but have seen both in code base

is the second argument to func supposed to be accessed via .iloc here as well?

No, at this point b is an ndarray.

Side note - an alternate approach for by column iteration is to call df.columns; not sure if there is a perf difference but have seen both in code base

That runs in to difficulties if there are duplicate columns.

not sure if there is a perf difference

Might be worth looking at using iat instead of iloc for perf

WillAyd · 2019-09-18T16:59:47Z

Sounds good - merge away

…

Sent from my iPhone

On Sep 18, 2019, at 9:46 AM, jbrockmendel ***@***.***> wrote: @jbrockmendel commented on this pull request. In pandas/core/ops/__init__.py: > @@ -499,8 +499,19 @@ def column_op(a, b): # in which case we specifically want to operate row-by-row assert right.index.equals(left.columns) - def column_op(a, b): - return {i: func(a.iloc[:, i], b.iloc[i]) for i in range(len(a.columns))} + if right.dtype == "timedelta64[ns]": + # ensure we treat NaT values as the correct dtype + # Note: we do not do this unconditionally as it may be lossy or + # expensive for EA dtypes. + right = np.asarray(right) + + def column_op(a, b): + return {i: func(a.iloc[:, i], b[i]) for i in range(len(a.columns))} not sure if there is a perf difference Might be worth looking at using iat instead of iloc for perf — You are receiving this because you commented. Reply to this email directly, view it on GitHub, or mute the thread.

jreback · 2019-09-20T14:22:03Z

lgtm merge on green

jbrockmendel · 2019-09-20T23:01:12Z

@WillAyd gentle ping

WillAyd · 2019-09-20T23:36:19Z

Thanks @jbrockmendel

…me ops (pandas-dev#28049)

jbrockmendel added 4 commits August 20, 2019 17:59

BUG: timedelta64(NaT) incorrectly treated as datetime in some datafra…

3d24808

…me ops

Merge branch 'master' of https://github.com/pandas-dev/pandas into tdfix

653967e

Merge branch 'master' of https://github.com/pandas-dev/pandas into tdfix

376c157

Merge branch 'master' of https://github.com/pandas-dev/pandas into tdfix

b60f9f8

TomAugspurger reviewed Aug 26, 2019

View reviewed changes

pandas/core/ops/__init__.py Show resolved Hide resolved

Merge branch 'master' of https://github.com/pandas-dev/pandas into tdfix

ce0a36f

jreback requested changes Sep 2, 2019

View reviewed changes

pandas/core/ops/__init__.py Show resolved Hide resolved

jbrockmendel added 2 commits September 3, 2019 08:13

Merge branch 'master' of https://github.com/pandas-dev/pandas into tdfix

701fe31

comment

90b92a0

TomAugspurger added this to the 1.0 milestone Sep 3, 2019

TomAugspurger approved these changes Sep 3, 2019

View reviewed changes

jbrockmendel added 2 commits September 3, 2019 09:20

whatsnew

9c12001

Merge branch 'master' of https://github.com/pandas-dev/pandas into tdfix

e142f58

Merge branch 'master' of https://github.com/pandas-dev/pandas into tdfix

b865438

jreback approved these changes Sep 17, 2019

View reviewed changes

jbrockmendel and others added 3 commits September 17, 2019 07:16

Merge branch 'master' of https://github.com/pandas-dev/pandas into tdfix

71ef709

DOC: move whatsnew note to 1.0

86070e1

reverted whitespace in 0.25 whatsnew

e3514e4

WillAyd requested changes Sep 18, 2019

View reviewed changes

jbrockmendel added 2 commits September 20, 2019 07:13

Merge branch 'master' of https://github.com/pandas-dev/pandas into tdfix

222e636

Merge branch 'tdfix' of github.com:jbrockmendel/pandas into tdfix

fa48c27

jbrockmendel mentioned this pull request Sep 20, 2019

TST: parametrize test_expressions #28493

Merged

WillAyd approved these changes Sep 20, 2019

View reviewed changes

WillAyd merged commit f08a1e6 into pandas-dev:master Sep 20, 2019

jbrockmendel deleted the tdfix branch September 20, 2019 23:45

proost pushed a commit to proost/pandas that referenced this pull request Dec 19, 2019

BUG: timedelta64(NaT) incorrectly treated as datetime in some datafra…

623566f

…me ops (pandas-dev#28049)

proost pushed a commit to proost/pandas that referenced this pull request Dec 19, 2019

BUG: timedelta64(NaT) incorrectly treated as datetime in some datafra…

e3838b3

…me ops (pandas-dev#28049)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: timedelta64(NaT) incorrectly treated as datetime in some dataframe ops #28049

BUG: timedelta64(NaT) incorrectly treated as datetime in some dataframe ops #28049

jbrockmendel commented Aug 21, 2019

jbrockmendel commented Aug 26, 2019

TomAugspurger left a comment

jbrockmendel commented Sep 3, 2019

jbrockmendel commented Sep 10, 2019

jreback left a comment

jreback Sep 17, 2019

jbrockmendel Sep 17, 2019

WillAyd left a comment

WillAyd Sep 18, 2019

jbrockmendel Sep 18, 2019

jbrockmendel Sep 18, 2019

WillAyd commented Sep 18, 2019 via email

jreback commented Sep 20, 2019

jbrockmendel commented Sep 20, 2019

WillAyd commented Sep 20, 2019

BUG: timedelta64(NaT) incorrectly treated as datetime in some dataframe ops #28049

BUG: timedelta64(NaT) incorrectly treated as datetime in some dataframe ops #28049

Conversation

jbrockmendel commented Aug 21, 2019

jbrockmendel commented Aug 26, 2019

TomAugspurger left a comment

Choose a reason for hiding this comment

jbrockmendel commented Sep 3, 2019

jbrockmendel commented Sep 10, 2019

jreback left a comment

Choose a reason for hiding this comment

jreback Sep 17, 2019

Choose a reason for hiding this comment

jbrockmendel Sep 17, 2019

Choose a reason for hiding this comment

WillAyd left a comment

Choose a reason for hiding this comment

WillAyd Sep 18, 2019

Choose a reason for hiding this comment

jbrockmendel Sep 18, 2019

Choose a reason for hiding this comment

jbrockmendel Sep 18, 2019

Choose a reason for hiding this comment

WillAyd commented Sep 18, 2019 via email

jreback commented Sep 20, 2019

jbrockmendel commented Sep 20, 2019

WillAyd commented Sep 20, 2019